SEQUENCE LISTING 

(1) GENERAL INFORMATION : 
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(2) INFORMATION FOR SEQ ID NO: 1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gly Glu Ser Asp Gly Gly Tyr Met Asp Met Ser Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp Met Ser Lys Asp Glu Ser Val Asp Tyr Val Pro Met Leu Asp Met 
15 10 15 

Lys 



;3 (2) INFORMATION FOR SEQ ID NO: 3: 

^ (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 11 amino acids 

" (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Asp Glu Ser Val Asp Tyr Val Pro Met Leu-, 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Ala Arq Asp He Met Arg Asp Ser Asn Tyr He Ser Lys Gly Ser Thr 
1 5 10 15 

Phe 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Phe Cys Pro Asp Pro Leu Tyr Glu Val Met Leu Lys 
3 1 5 10 

j (2) INFORMATION FOR SEQ ID NO: 6: 

I (i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 16 amino acids 

i (B) TYPE: amino acid 

1 (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

% 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



3 Arcr Arq Phe Thr Ser Thr Glu Pro Gin Tyr Gin Pro Gly Glu Asn Leu 

3 1 5 ' 10 . I 5 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Arq Arq Leu He Glu Asp Asn Glu Tyr Thr Ala Arg Gly 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Thr Leu Ala Tyr Pro Val Tyr Ala Gin Gin Arg Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
-3 (D) TOPOLOGY: linear 

IU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

H Thr Leu Ala His Pro Val Arg Ala Pro Gly Pro Gly Pro Pro Ala Ala 
Q 1 5 10 15 

I. M ' 

5l_ Arg , 

0(2) INFORMATION FOR SEQ ID NO: 10: 

5.5 

% (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Tyr Xaa Xaa Met 
1 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



= 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Met Asp Trp lie Phe His Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AARATGGAYT GGATHTTYCA YAC 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Asp Gly pin Leu Phe His He Asp Phe Gly His 
1 5 10 



;3 (2) INFORMATION FOR SEQ ID NO: 14: 

-J 

=3 (i) SEQUENCE CHARACTERISTICS: 

.2 (A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GATGATGGCC ARCTGTTYCA YATWGAYTTT GGCCA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AATTCACACA CTGGCATGCC GAT 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
GACTCGAGTC GACATCGATT TTTTTTTTTT TTTTT 



'(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 
i TTTAAGCTTA GGCATTCTAA AGTCACTATC ATCCC 



\ (2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
GACTCGAGTC GACATCGA 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 



CAGGCCTGGC TTCCTGT 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
AACCAGGCTC AACTGTT 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
5 TGCTGTAAAT TCTAATGCTG 



5(2) INFORMATION FOR SEQ ID NO: 22: 

^ (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 20 base pairs 

3 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTATTTCATG AAACAAATGA 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AAGGATCAGA ACAATGCCT 



3 (2) INFORMATION FOR SEQ ID NO: 25: 

4 (i) SEQUENCE CHARACTERISTICS: 

t (A) LENGTH: 19 base pairs 

=j (B) TYPE: nucleic acid 

i (C) STRANDEDNESS: single 

i CD) TOPOLOGY: linear 



3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

n ■ 

3 AGGCTTTCTT TAGCCATCA 



3 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Gly Asp Asp Leu Arg Gin Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Phe His He Asp Phe Gly His Phe 
1 5 



^(2) INFORMATION FOR SEQ ID NO: 29: 

ry 

lf\ (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 23 base pairs 

;1 (B) TYPE: nucleic acid 

\n (C) STRANDEDNESS: single 

V (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
Glu Glu Glu Glu Glu Tyr Met Pro Met Xaa 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 



GGNGAYGAYY TRCGNCARGA 



RAARTGCCRA ARTCDATRTG RAA 




(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Asp Asp Asp Asp Asp Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single or double 

(D) TOPOLOGY: linear 



.j 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..3204 

(D) OTHER INFORMATION: /standard_name= "CDS" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



ATG CCT CCA AGA CCA TCA TCA GGT GAA CTG TGG GGC ATC CAC TTG ATG 48 
Met Pro Pro Arg Pxo Ser Ser Gly Glu Leu Trp Gly lie His Leu Met 
3 1 5 10 15 

*% 

1CCC CCA AGA ATC CTA GTG GAA TGT TTA CTA CCA AAT GGA ATG ATA GTG 96 
Spro Pro Arg lie Leu Val Glu Cys Leu Leu Pro Asn Gly Met lie Val 
~ 20 25 30 

ACT TTA GAA TGC CTC CGT GAG GCT ACA TTA GTA ACT ATA AAG CAT GAA 144 
Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu Val Thr lie Lys His Glu 
35 40 45 

CTA TTT AAA GAA GCA AGA AAA TAC CCT CTC CAT CAA CTT CTT CAA GAT 192 
Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 

GAA TCT TCT TAC ATT TTC GTA AGT GTT ACC CAA GAA GCA GAA AGG GAA 240 
Glu Ser Ser Tyr lie Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 
65 70' 75 80 

GAA TTT TTT GAT GAA ACA AGA CGA CTT TGT GAT CTT CGG CTT TTT CAA 288 
Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 

85 90 95 



CCA TTT TTA AAA GTA ATT GAA CCA GTA GGC AAC CGT GAA GAA AAG ATC 336 
Pro Phe Leu Lys Val lie Glu Pro Val Gly Asn Arg Glu Glu Lys lie 
100 105 110 

CTC AAT CGA GAA ATT GGT TTT GCT ATC GGC ATG CCA GTG TGC GAA TTT 384 
Leu Asn Arg Glu lie Gly Phe Ala lie Gly Met Pro Val Cys Glu Phe 
115 - 120 125 

GAT ATG GTT AAA GAT CCT GAA GTA CAG GAC TTC CGA AGA AAT ATT CTT 432 
Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn lie Leu 
130 135 140 

AAT GTT TGT AAA GAA GCT GTG GAT CTT AGG GAT CTT AAT TCA CCT CAT 480 
Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

mGT AGA GCA ATG TAT GTC TAT CCG CCA CAT GTA GAA TCT TCA CCA GAG 528 
3Ser Arg Ala Met Tyr Val Tyr Pro Pro His Val Glu Ser Ser Pro Glu 
■J 165 170 175 

; =CTG CCA AAG CAC ATA TAT AAT AAA TTG GAT AGA GGC CAA ATA ATA GTG 576 
SLeu Pro Lys His lie Tyr Asn Lys Leu Asp Arg Gly Gin lie lie Val 
'% 180 185 190 

:=GTG ATT TGG GTA ATA GTT TCT CCA AAT AAT GAC AAG CAG AAG TAT ACT 624 
::Val lie Trp Val lie Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
^ 195 200 205 

1 

3CTG AAA ATC AAC CAT GAC TGT GTG CCA GAA CAA GTA ATT GCT GAA GCA 672 

Leu Lys lie Asn His Asp Cys Val Pro Glu Gin Val lie Ala Glu Ala 
210 215 220 

ATC AGG AAA AAA ACT AGA AGT ATG TTG CTA TCA TCT GAA CAA TTA AAA 720 
lie Arg Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

CTC TGT GTT TTA GAA TAT CAG GGC AAG TAC ATT TTA AAA GTG TGT GGA 768 
Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr lie Leu Lys Val Cys Gly 

245 250 255 

TGT GAT GAA TAC TTC CTA GAA AAA TAT CCT CTG AGT CAG TAT AAG TAT 816 
Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 
260 265 270 




ATA 
lie 


AGA 
Arg 


AGC 
Ser 
275 


TGT 
Cys 


ATA 
He 


ATG 
Met 


CTT 
Leu 


GGG 
Gly 
280 


AGG 
Arg 


ATG 
Met 


CCC 
Pro 


AAT 
Asn 


TTG 
Leu 
285 


AAG 
Lys 


ATG 
Met 


ATG 
Met 


864 


GCT 
Ala 


AAA 
Lys 
290 


GAA 
Glu 


AGC 
Ser 


CTT 
Leu 


TAT 
Tyr 


TCT 
Ser 
295 


CAA 
Gin 


CTG 
Leu 


CCA 
Pro 


ATG 
Met 


GAC 
Asp 
300 


TGT 
Cys 


TTT 
Phe 


ACA 
Thr 


ATG 
Met 


912 


CCA 
Pro 
305 


TCT 
Ser 


TAT 
Tyr 


TCC 
Ser 


AGA CGC 
Arg Arg 
310 


ATT 
He 


TCC 
Ser 


ACA 
Thr 


GCT 
Ala 


ACA 
Thr 
315 


CCA 
Pro 


TAT 
Tyr 


ATG 
Met 


AAT GGA 
Asn Gly 
320 


960 


GAA 
Glu 


ACA 
Thr 


TCT 
Ser 


ACA 
Thr 


7\ H 

AAA 
Lys 
325 


TCC 

Ser 


CTT 
Leu 


TGG 
Trp 


GTT 
Val 


ATA 
lie 
330 


AAT 
Asn 


AGA 
Arg 


GCA 
Ala 


CTC 
Leu 


AGA 
Arg 
335 


ATA 
He 


1008 


;3AAA 

■ii 


ATT 
lie 


CTT 
Leu 


TGT 
Cys 
340 


GCA 
Ala 


ACC 
Thr 


TAC 
Tyr 


GTG 
Val 


AAT 
Asn 
345 


CTA 
Leu 


AAT 
Asn 


ATT 
lie 


CGA 
Arg 


GAC 
Asp 
350 


ATT 
lie 


GAC 
Asp 


1056 


:=Jaag 
iLys 


ATT 
He 


TAT 
Tyr 
355 


GTT 
Val 


CGA 
Arg 

#* 


ACA 
Thr 


GGT 
Gly 


ATC 
He 
360 


TAC 
Tyr 


CAT 
His 


GGA 
Gly 


GGA 
Gly 


GAA 
Glu 
365 


CCC 
Pro 


TTA 
Leu 


TGT 
Cys 


1104 


;?GAC 
^Asp 

;3 
.j 


AAT 
Asn 
370 


GTG 
Val 


AAC 
Asn 


ACT 
Thr 


CAA 
Gin 


AGA 
Arg 
375 


GTA 
Val 


CCT 
Pro 


TGT 
Cys 


TCC 
Ser 


AAT 
Asn 
380 


CCC 
Pro 


AGG 
Arg 


TGG AAT 
Trp Asn 


1152 


JGAA 
Glu 
385 


TGG 
Trp 


CTG 
Leu 


AAT 
Asn 


TAT GAT 
Tyr Asp 
390 


ATA 
lie 


TAC 
Tyr 


ATT 
lie 


CCT 
Pro 


GAT 
Asp 
395 


CTT 
Leu 


CCT 
Pro 


CGT 
Arg 


GCT 
Ala 


GCT 
Ala 
400 


1200 


CGA 
Arg 


CTT 
Leu 


TGC 
Cys 


CTT 
Leu 


TCC 
Ser 
405 


ATT 
He 


TGC 
Cys 


TCT 
Ser 


GTT 
Val 


AAA 
Lys 
410 


GGC 
Gly 


CGA 
Arg 


AAG 
Lys 


GGT 
Gly 


GCT 
Ala 
415 


AAA 
Lys 


1248 


GAG 
Glu 


GAA 
Glu 


CAC 
His 


TGT 
Cys 
420 


CCA 
Pro 


TTG 
Leu 


GCA 
Ala 


TGG 
Trp 


GGA 
Gly 
425 


AAT 
Asn 


ATA 
He 


AAC 
Asn 


TTG 
Leu 


TTT 
Phe 
430 


GAT 
Asp 


TAC 
Tyr 


1296 


ACA 
Thr 


GAC 
Asp 


ACT 

ml 

Thr 
435 


CTA 
Leu 


GTA 

Val 


TCT 
Ser 


GGA 
Gly 


AAA 
Lys 
440 


ATG 
Met 


GCT 

TV -I _ 

Ala 


TTG 
Leu 


AAT 
Asn 


CTT 
Leu 
445 


TGG 
Trp 


CCA 
Pro 


GTA 
Val 


1344 


CCT 


CAT 


GGA 


TTA 


GAA 


GAT 


TTG 


CTG 


AAC 


CCT 


ATT 


GGT 


GTT 


ACT 


GGA 


TCA 


1392 



Pro His Gly Leu Glu Asp Leu Leu Asn Pro lie Gly Val Thr Gly Ser 
450 455 460 

AAT CCA AAT AAA GAA ACT CCA TGC TTA GAG TTG GAG TTT GAC TGG TTC 1440 
Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 

AGC AGT GTG GTA AAG TTC CCA GAT ATG TCA GTG ATT GAA GAG CAT GCC 1488 
Ser Ser Val Val Lys Phe Pro Asp Met Ser Val lie Glu Glu His Ala 

485 490 495 

AAT TGG TCT GTA TCC CGA GAA GCA GGA TTT AGC TAT TCC CAC GCA GGA 153 6 

Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
500 505 510 

= CTG AGT AAC AGA CTA GCT AGA GAC AAT GAA TTA AGG GAA AAT GAC AAA 1584 
=Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
3 515 520 525 



iGAA CAG CTC AAA GCA ATT TCT ACA CGA GAT CCT CTC TCT GAA ATC ACT . 1632 
3-Glu Gin Leu Lys Ala lie Ser Thr Arg Asp Pro Leu Ser Glu He Thr 
3 530 535 540 

f| 

iGAG CAG GAG AAA GAT TTT CTA TGG AGT CAC AGA CAC TAT TGT GTA ACT 1680 
!J Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
^545 - 550 555 560 

d 

?ATC CCC GAA ATT CTA CCC AAA TTG CTT CTG TCT GTT AAA TGG AAT TCT 1728 

J Ile Pro Glu He Leu Pro Lys Leu Leu Leu' Ser Val Lys Trp Asn Ser 

565 570 575 

AGA GAT GAA GTA GCC CAG ATG TAT TGC TTG GTA AAA GAT TGG CCT CCA 1776 
Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
580 585 590 

ATC AAA CCT GAA CAG GCT ATG GAA CTT CTG GAC TGT AAT TAC CCA GAT 1824 
He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
595 600 605 

CCT ATG GTT CGA GGT TTT GCT GTT CGG TGC TTG GAA AAA TAT TTA ACA 1872 
Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 



GAT GAC AAA CTT TCT CAG TAT TTA ATT CAG CTA GTA CAG GTC CTA AAA 1920 
Asp Asp Lys Leu Ser Gin Tyr Leu lie Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

TAT GAA CAA TAT TTG GAT AAC TTG CTT GTG AGA TTT TTA CTG AAG AAA 1968 
Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 

645 650 655 

GCA TTG ACT AAT CAA AGG ATT GGG CAC TTT TTC TTT TGG CAT TTA AAA 2016 
Ala Leu Thr Asn Gin Arg lie Gly His Phe Phe Phe Trp His Leu Lys 
660 665 670 

TCT GAG ATG CAC AAT AAA ACA GTT AGC CAG AGG TTT GGC CTG CTT TTG 2064 
Ser Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 685 



= GAG TCC TAT TGT CGT GCA TGT GGG ATG TAT TTG AAG CAC CTG AAT AGG 2112 
IgIu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

t CAA GTC GAG GCA ATG GAA AAG CTC ATT AAC TTA ACT GAC ATT CTC AAA 2160 
:'Gln Val Glu Ala Met Glu Lys Leu lie Asn Leu Thr Asp lie Leu Lys 
1 705 , 710 715 720 

1 

^ CAG GAG AGG AAG GAT GAA ACA CAA AAG GTA CAG ATG AAG TTT TTA GTT 2208 

3 Gin Glu Arg Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 

J 725 730 735 

~ GAG CAA ATG AGG CGA CCA GAT TTC ATG GAT GCC CTA CAG GGC TTG CTG 2256 
Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Leu Leu 
740 • 745 750 

TCT CCT CTA AAC CCT GCT CAT CAA CTA GGA AAC CTC AGG CTT AAA GAG 2304 
Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Lys Glu 
755 760 765 

TGT CGA ATT ATG TCT TCT GCA AAA AGG CCA CTG TGG TTG AAT TGG GAG 2352 
Cys Arg lie Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

AAC CCA GAC ATC ATG TCA GAG TTA CTG TTT CAG AAC AAT GAG ATC ATC 2400 
Asn Pro Asp lie Met Ser Glu Leu Leu Phe Gin Asn Asn Glu lie lie 
785 790 795 800 



TTT AAA AAT GGG GAT GAT TTA CGG CAA GAT ATG CTA ACA CTT CAA ATT 2448 
Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin lie 

805 810 815 

ATT CGT ATT ATG GAA AAT ATC TGG CAA AAT CAA GGT CTT GAT CTT CGA 2496 
lie Arg lie Met Glu Asn lie Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 

ATG TTA CCT TAT GGT TGT CTG TCA ATC GGT GAC TGT GTG GGA CTT ATT 2544 
Met Leu Pro Tyr Gly Cys Leu Ser lie Gly Asp Cys Val Gly Leu lie 
835 840 845 

GAG GTG GTG CGA AAT TCT CAC ACT ATT ATG CAA ATT CAG TGC AAA GGC 2592 
Glu Val Val Arg Asn Ser His Thr lie Met Gin lie Gin Cys Lys Gly 
850 855 860 

OgGC TTG AAA GGT GCA CTG CAG TTC AAC AGC CAC ACA CTA CAT CAG TGG 2640 
UGly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 
W865 870 875 880 

m 

pCTC AAA GAC AAG AAC AAA GGA GAA ATA TAT GAT GCA GCC ATT GAC CTG 2688 

= 3 Leu Lys Asp Lys Asn Lys Gly Glu lie Tyr Asp Ala Ala lie Asp Leu 
\fl 885 890 895 

f^TTT ACA CGT TCA TGT GCT GGA TAC TGT GTA GCT ACC TTC ATT TTG GGA 2736 
•;^Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe lie Leu Gly 
\i 900 905 910 

"•: =? 

^ ATT GGA GAT CGT CAC AAT AGT AAC ATC ATG GTG AAA GAC GAT GGA CAA 2784 

lie Gly Asp Arg His Asn Ser Asn lie Met Val Lys Asp Asp Gly Gin 
915 920 925 

CTG TTT CAT ATA GAT TTT GGA CAC TTT TTG GAT CAC AAG AAG AAA AAA 2832 
Leu Phe His lie Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

TTT GGT TAT AAA CGA GAA CGT GTG CCA TTT GTT TTG ACA CAG GAT TTC 2880 
Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 
945 950 955 960 

. TTA ATA GTG ATT AGT AAA GGA GCC CAA GAA TGC ACA AAG ACA AGA GAA 2928 
Leu He Val He Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 

965 970 975 



TTT GAG AGG TTT CAG GAG ATG TGT TAC AAG GCT TAT CTA GCT ATT CGA 
Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 
980 985 990 



2976 



CAG CAT GCC AAT CTC TTC ATA AAT CTT TTC TCA ATG ATG CTT GGC TCT 3024 
Gin His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

GGA ATG CCA GAA CTA CAA TCT TTT GAT GAC ATT GCA TAC ATT CGA AAG 3 072 

Gly Met Pro Glu Leu Gin Ser Phe Asp Asp He Ala Tyr He Arg Lys 
1010 1015 1020 

ACC CTA GCC TTA GAT AAA ACT GAG CAA GAG GCT TTG GAG TAT TTC ATG 3120 
Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
102 5 1030 1035 1040 

AAA CAA ATG AAT GAT GCA CAT CAT GGT GGC TGG ACA ACA AAA ATG GAT 3168 
=>Lvs Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 
3 y 1045 1050 1055 

y 

^TGG ATC TTC CAC ACA ATT AAA CAG CAT GCA TTG AAC TGAAAGATAA 3214 
= Trp He Phe His Thr He Lys Gin His Ala Leu Asn 
^ 1060 1065 

^ CTGAGAAAAT GAAAGCJCAC TCTGGACACT ACACTGCACT GTTAATAACT CTCAGCAGGC 3274 
? AAAGACCGAT TGCATAGGAA TTGCACAATC CATGAACAGC ATTAGATTTA CAGCAAGAAC 3334 
^ AGAAATAAAA TACTATATAA TTTAAATAAT GTAAACGCAA ACAGGGTTTG ATAGCACTTA 33 94 
3 AACTAGTTCA TTTCAAAA 



3412 



(2) INFORMATION FOR SEQ ID NO: 33:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 amino acids residues 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Asn He Thr Phe Cys Val Ser Gin Asp Leu Asp Val Pro Leu Lys Val 
1 5 10 15 



Lvs He Lys Ser Leu Glu Gly His Lys Pro Leu Leu Lys Pro Ser Gin 

20 25 30 

Lys He Leu Asn Pro Glu Leu Met Leu He Gly Ser Asn Val Phe Pro 
35 40 45 



Ser Ser Asp Leu He Val Ser Leu Gin Val Phe Asp Lys Glu Arg Asn 
50 55 60 

Arq Asn Leu Thr Leu Pro He Tyr Thr Pro Tyr He Pro Phe Arg Asn 
65 y 70 75 - 80 

Ser Arg Thr Trp Asp Tyr Trp Leu Thr Leu Pro He Arg He Lys Gin 

85 90 95 

Leu Thr Phe Ser Ser His Leu Arg He lie Leu Trp Glu Tyr Asn Gly 
100 105 HO 

Ser Lys Gin He Pro Phe Phe Asn Leu Glu Thr Ser He Phe Asn Leu 
115 120 125 

Lys Asp Cys Thr Leu Lys Arg Gly Phe Glu Ser Leu Lys Phe Arg Tyr 
130 135 140 



Asp Val He Asp His Cys Glu Val Val Thr Asp Asn Lys Asp Gin Glu 
145 150 155 160 

Asn Leu Asn £ys Tyr Phe Gin Gly Glu Phe Thr Arg Leu Pro Trp Leu 



165 



170 



Asp Glu He Thr He Ser Lys Leu Arg Lys Gin Arg Glu Asn Arg Thr 
180 185 190 

Trp Pro Gin Gly Thr Phe Val Leu Asn Leu Glu Phe Pro Met Leu Glu 
195 200 205 

Leu Pro Val Val Phe He Glu Arg Glu He Met Asn Thr Gin Met Asn 
210 215 220 

lie Pro Thr Leu Lys Asn Asn Pro Gly Leu Ser -/Thr Asp Leu Arg Glu 
225 230 235 240 

Pro Asn Arg Asn Asp Pro Gin He Lys He Ser Leu Gly Asp Lys Tyr 

245 250 255 



His Ser Thr Leu Lys Phe Tyr Asp Pro Asp Gin Pro Asn Asn Asp ?ro 

265 270 



260 



He Glu Glu Lys Tyr Arg Arg Leu Glu Arg Ala Ser Lys Asn Ala Asn 
275 280 285 



Leu Asp Lys Gin Val Lys Pro Asp He Lys Lys Arg Asp Tyr Leu Asn 
290 295 300 

Lys lie He Asn Tyr Pro Pro Gly Thr Lys Leu Thr Ala His Glu Lys 
305 310 315 320 

Gly Ser He Trp Lys Tyr Arg Tyr Tyr Leu Met Asn Asn Lys Lys Ala 

325 330 335 

Leu Thr Lys Leu Leu Gin Ser Thr Asn Leu Arg Glu Glu Ser Glu Arg 
- 340 345 350 

Val Glu Val Leu Glu Leu Met Asp Ser Trp Ala Glu He Asp lie Asp 
355 360 365 

Asp Ala Leu Glu Leu Leu Gly Ser Thr Phe Lys Asn Leu Ser Val Arg 
370 375 380 



Ser Tyr Ala Val Asn Arg Leu Lys Lys Ala Ser Asp Lys Glu Leu Glu 
385 , 390 395 400 

Leu Tyr Leu Leu Gin Leu Val Glu Ala Val Cys Phe Glu Asn Leu Ser 

405 410 415 



Thr Phe Ser Asp Lys Ser Asn Ser Glu Phe Thr He Val Asp Ala Val 
420 425 430 

Ser Ser Gin Lys Leu Ser Gly Asp Ser Met Leu Leu Ser Thr Ser His 
435 440 445 

Ala Asn Gin Lys Leu Leu Lys Ser He Ser Ser Glu Ser Glu Thr Ser 
450 455 460 



Gly Thr Glu Ser 
465 

He Arg Arg Ala 



Leu Pro He Val 
470 

Leu Val Asn Pro 
485 



He Ser Pro Leu 
475 

Arg Leu Gly Ser 
490 



Ala Glu Phe Leu 
480 

Phe Phe Tyr Trp 
495 



Leu Lys Ser Glu Ser Glu Asp Lys Pro Tyr Leu Asp Gin He Leu 
500 505 510 



Ser Ser Phe Trp Ser Arg Leu Asp Lys Lys Ser Arg Asn lie Leu Asn 
515 520 525 



Asp Gin Val Arg Leu He Asn Val Leu Arg Glu Cys Cys Glu Thr He 
530 535 540 

Lys Arg Leu Lys Asp Thr Thr Ala Lys Lys Met Glu Leu Leu Val His 
545 550 555 560 

Leu Leu Glu Thr Lys Val Arg Pro Leu Val Lys Val Arg Pro He Ala 

565 570 575 

Leu Pro Leu Asp Pro Asp Val Leu He Cys Asp Val Cys Pro Glu Thr 
580 585 590 

Ser Lys Val Phe Lys Ser Ser Leu Ser Pro Leu Lys He Thr Phe Lys 
595 600 605 

Thr Thr Leu Asn Gin Pro Tyr His Leu Met Phe Lys Val Gly Asp Asp 
610 615 620 

Leu Arg Gin Asp Gin Leu Val Val Gin He He Ser Leu Met Asn Glu 
625 ' 630 635 640 

Leu Leu Lys Asn Glu Asn Val Asp Leu Lys Leu Thr Pro Tyr Lys He 

645 650 655 

Leu Ala Thr Gly Pro Gin Glu Gly Ala lie Glu Phe He Pro Asn Asp 
660 665 670 

Thr Leu Ala Ser He Leu Ser Lys Tyr His Gly He Leu Gly Tyr Leu 
675 680 685 

Lys Leu His Tyr Pro Asp Glu Asn Ala Thr Leu Gly Val Gin Gly Trp 
690 695 -700 

Val Leu Asp Asn Phe Val Lys Ser Cys Ala Gly Tyr Cys Val He Thr 
705 710 715 720 

Tyr He Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu Leu Val Thr 

725 730 735 



Pro Asp Gly His Phe Phe His Ala Asp Phe Gly Tyr He Leu Gly Gin 
740 745 750 



J 



Asp Pro Lys Pro Phe Pro Pro Leu Met Lys Leu Pro Pro Gin lie lie 
755 760 765 



Glu Ala Phe Gly Gly Ala Glu Ser Ser Asn Tyr Asp Lys Phe Arg Ser 
770 775 780 

Tyr Cys Phe Val Ala Tyr Ser lie Leu Arg Arg Asn Ala Gly Leu lie 
785 790 795 800 

Leu Asn Leu Phe Glu Leu Met Lys Thr Ser Asn lie Pro Asp He Arg 

805 810 815 



He Asp Pro Asn Gly Ala He Leu Arg Val Arg Glu Arg Phe Asn Leu 
820 825 830 



Asn Met Ser Glu Glu Asp Ala Thr Val His Phe Gin Asn Leu He Asn 
835 840 845 



Asp Ser Val Asn Ala Leu Leu Pro He Val He Asp His Leu His Asn 
850 855 860 



Leu Ala Gin Tyr 
865 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



ATGCCTCCAA 


GACCATCATC 


AGGTGAACTG 


TGGGGCATCC 


ACTTGATGCC 


CCCAAGAATC 


60 


CTAGTGGAAT 


GTTTACTACC 


AAATGGAATG 


ATAGTGACTT 


TAGAATGCCT 


CCGTGAGGCT 


120 


ACATTAGTAA 


CTATAAAGCA 


TGAACTATTT 


AAAGAAGCAA 


GAAAATACCC 


TCTCCATCAA 


180 


CTTCTTCAAG 


ATGAATCTTC 


TTACATTTTC 


GTAAGTGTTA 


CCCAAGAAGC 


AGAAAGGGAA 


240 


GAATTTTTTG 


ATGAAACAAG 


ACGACTTTGT 


GATCTTCGGC 


TTTTTCAACC 


ATTTTTAAAA 


300 


GTAATTGAAC 


CAGTAGGCAA 


CCGTGAAGAA 


AAGATCCTCA 


ATCGAGAAAT 


TGGTTTTGCT 


360 



ATCGGCATGC CAGTGTGCGA ATTTGATATG GTTAAAGATC CTGAAGTACA GGACTTCCGA 420 

AGAAATATTC TTAATGTTTG TAAAGAAGCT GTGGATCTTA GGGATCTTAA TTCACCTCAT 480 

AGTAGAGCAA TGTATGTCTA TCCGCCACAT GTAGAATCTT CACCAGAGCT GCCAAAGCAC 540 

ATATATAATA AATTGGATAG AGGCCAAATA ATAGTGGTGA TTTGGGTAAT AGTTTCTCCA 600 

AATAATGACA AGCAGAAGTA TACTCTGAAA ATCAACCATG ACTGTGTGCC AGAACAAGTA 660 

ATTGCTGAAG CAATCAGGAA AAAAACTAGA AGTATGTTGC TATCATCTGA ACAATTAAAA 720 

CTCTGTGTTT TAGAATATCA GGGCAAGTAC ATTTTAAAAG TGTGTGGATG TGATGAATAC 780 

TTCCTAGAAA AATATCCTCT GAGTCAGTAT AAGTATATAA GAAGCTGTAT AATGCTTGGG 840 

AGGATGCCCA ATTTGAAGAT GATGGCTAAA GAAAGCCTTT ATTCTCAACT GCCAATGGAC 900 

TGTTTTACAA TGCCATCTTA TTCCAGACGC ATTTCCACAG CTACACCATA TATGAATGGA 960 

^ AAAC AT CTA CAAAATCCCT TTGGGTTATA AATAGAGCAC TCAGAATAAA AATTCTTTGT 1020 

licAACCTATG TGAATGTAAA TATTCGAGAC ATTGACAAGA TTTATGTTCG AACAGGTATC 1080 

!: ^TACCATGGAG GAGAACCCTT ATGTGACAAT GTGAACACTC AAAGAGTACC TTGTTCCAAT 1140 

[jcCCAGGTGGA ATGAATGGCT GAATTATGAT ATATACATTC CTGATCTTCC TCGTGCTGCT 1200 

^CGACTTTGCC TTTCCAfTTG CTCTGTTAAA GGCCGAAAGG GTGCTAAAGA GGAACACTGT 1260 

EjjjCCATTGGCAT GGGGAAATAT AAACTTGTTT GATTACACAG ACACTCTAGT ATCTGGAAAA 1320 

;:3 ATGGCTTTGA ATCTTTGGC C AGTACCTCAT GGATTAGAAG ATTTGCTGAA CCCTATTGGT 1380 

^GTTACTGGAT CAAATCCAAA TAAAGAAACT CCATGCTTAG AGTTGGAGTT TGACTGGTTC 1440 

AGCAGTGTGG TAAAGTTCCC AGATATGTCA GTGATTGAAG AGCATGCCAA TTGGTCTGTA 1500 

TCCCGAGAAG CAGGATTTAG CTATTCCCAC GCAGGACTGA GTAACAGACT AGCTAGAGAC 1560 

AATGAATTAA GGGAAAATGA CAAAGAACAG CTCAAAGCAA TTTCTACACG AGATCCTCTC 1620 

TCTGAAATCA CTGAGCAGGA GAAAGATTTT CTATGGAGTC ACAGACACTA TTGTGTAACT 1680 

ATCCCCGAAA TTCTACCCAA ATTGCTTCTG TCTGTTAAAT GGAATTCTAG AGATGAAGTA 1740 

GCCCAGATGT ATTGCTTGGT AAAAGATTGG CCTCCAATCA AACCTGAACA GGCTATGGAA 1800 

CTTCTGGACT GTAATTACCC AGATCCTATG GTTCGAGGTT TTGCTGTTCG GTGCTTGGAA 1860 

AAATATTTAA CAGATGACAA ACTTTCTCAG TATTTAATTC AGCTAGTACA GGTCCTAAAA 1920 

TATGAACAAT ATTTGGATAA CTTGCTTGTG AGATTTTTAC TGAAGAAAGC ATTGACTAAT 1980 

CAAAGGATTG GGCACTTTTT CTTTTGGCAT TTAAAATCTG AGATGCACAA TAAAACAGTT 2040 




AGCCAGAGGT TTGGCCTGCT TTTGGAGTCC 


TATTGTCGTG 


CATGTGGGAT 


GTATTTGAAG 


2100 


CACCTGAATA GGCAAGTCGA GGCAATGGAA 


AAGCTCATTA 


ACTTAACTGA 


CATTCTCAAA 


Z±o\J 


CAGGAGAGGA AGGATGAAAC ACAAAAGGTA 


CAGATGAAGT 


TTTTAGTTGA GCAAATGAGG 




CGACCAGATT TCATGGATGC CCTACAGGGC 


TTGCTGTCTC 


CTCTAAACCC 


TGCTCATCAA 




CTAGGAAACC TCAGGCTTAA AGAGTGTCGA ATTATGTCTT 


CTGCAAAAAG 


GCCACTGTGG 




TTGAATTGGG AGAACCCAGA CATCATGTCA 


GAGTTACTGT 


TTCAGAACAA 


TGAGATCATC 


£>*k U U 


TTTAAAAATG GGGATGATTT ACGGCAAGAT ATGCTAACAC 


TTCAAATTAT 


TCGTATTATG 




GAAAAJ.A1L.1 cbLAfinnlwi /iwlLlluni 


CTTCGAATGT 


TACCTTATGG 


TTGTCTGTCA 




ATCGGTGACT GTGTGGGACT TATTGAGGTG 


GTGCGAAATT 


CTCACACTAT 


TATGCAAATT 




CAGTGCAAAG GCGGCTTGAA AGGTGCACTG 


CAGTTCAACA 


GCCACACACT 


ACATCAGTGG 


o e a n 


\ CTCAAAGACA AGAACAAAGG AGAAATATAT 


GATGCAGCCA 


TTGACCTGTT 


TACACGTTCA 


o *7 n o. 


J TGTGCTGGAT ACTGTGTAGC TACCTTCATT 


TTGGGAATTG 


GAGATCGTCA 


CAATAGTAAC 




y 

F ! ATCATGGTGA AAGACGATGG ACAACTGTTT 


CATATAGATT 


TTGGACACTT 


TTTGGATCAC 


^ o ^ w 


3 AAGAAGAAAA AATTTGGTTA TAAACGAGAA 


CGTGTGCCAT 


TTGTTTTGAC 


ACAGGATTTC 


o q q n 


TTAATAGTGA TTAGTA^AGG AGCCCAAGAA 


TGCACAAAGA 


CAAGAGAATT 


TCjACjA^jo ill 


z?4U 


1 

S CAGGAGATGT GTTACAAGGC TTATCTAGCT 


ATTCGACAGC 


ATGCCAATCT 


CTTCATAAAT 




1 

j CTTTTCTCAA TGATGCTTGG CTCTGGAATG 


CCAGAACTAC 


AATCTTTTGA 


TGACATTGCA 


jUoU 


^ TACATTCGAA AGACCCTAGC CTTAGATAAA ACTGAQCAAG 


AGGCTTTGGA 


GTATTTCATG 


3120 


AAACAAATGA ATGATGCACA TCATGGTGGC 


TGGACAACAA 


AAATGGATTG 


GATCTTCCAC 


3180 


ACAATTAAAC AGCATGCATT GAACTGAAAG 


ATAACTGAGA AAATGAAAGC 


TCACTCTGGA 


3240 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
ATGCCTCCAA GACCATCATC AGGTGAACTG TGGGGCATCC ACTTGATGCC CCCAAGAATC 
CTAGTAGAAT GTTTACTACC AAATGGGATG ATAGTGACTT TAGAATGCCT CCGTGAGGCT 



180 
240 



ACGTTAATAA CGATAAAGCA TGAACTATTT AAAGAAGCAA GAAAATACCC TCTCCATCAA 
CTTCTTCAAG ATGAATCTTC TTACATTTTC GTAAGTGTTA CCCAAGAAGC AGAAAGGGAA 

GAATTTTTTG ATGAAACAAG ACGACTTTGT GACCTTCGGC TTTTTCAACC CTTTTTAAAA 300 

GTAATTGAAC CAGTAGGCAA CCGTGAAGAA AAGATCCTCA ATCGAGAAAT TGGTTTTGCT 360 

ATCGGCATGC CAGTGTGTGA ATTCGATATG GTTAAAGATC CAGAAGTACA GGACTTCCGA 420 

AGAAATATTC TCAATGTTTG TAAAGAAGCT GTGGATCTTA GGGATCTTAA TTCACCTCAT 480 

AGTAGAGCAA TGTATGTTTA TCCTCCAAAT GTAGAATCTT CACCAGAACT GCCAAAGCAC 540 

ATATATAATA AATTGGATAA AGGGCAAATA ATAGTGGTGA TTTGGGTAAT AGTTTCTCCA 60C 

AATAATGACA AACAGAAGTA TACTCTGAAA ATCAACCATG ACTGTGTGCC AGAACAAGTA 66C 

ATTGCTGAAG CAATCAGGAA AAAAACTCGA AGTATGTTGC TATCATCTGA ACAACTAAAA 72 C 

3CTCTGTGTTT TAGAATATCA GGGCAAGTAT ATTTTAAAAG TGTGTGGATG TGATGAATAC 78( 

3TTCCTAGAAA AATATCCTCT GAGTCAGTAT AAGTATATAA GAAGCTGTAT AATGCTTGGG 84 ( 

JaGGATGCCCA ATTTGATGCT GATGGCTAAA GAAAGCCTCT ATTCTCAACT GCCAATGGAC 90( 

?TGTTTTACAA TGCCATCATA TTCCAGACGC ATCTCCACAG CTACGCCATA TATGAATGGA 961 

^ GAAACATCTA CAAAATfCCT TTGGGTTATA AATAGTGCAC TCAGAATAAA AATTCTTTGT 102 < 

3GCAACCTATG TGAATGTAAA TATTCGAGAC ATTGACAAGA TTTATGTTCG AACAGGTATC 108 

3 TACCATGGAG GAGAACCCTT ATGTGATAAT GTGAACACTC AAAGAGTACC TTGTTCCAAT 114 

3 CCCAGGTGGA ATGAATGGCT GAATTACGAT ATATACATTC CTGATCTTCC TCGTGCTGCT 120 

3 CGACTTTGCC TTTCCATTTG TTCTGTTAAA GGCCGAAAGG GTGCTAAAGA GGAACACTGT 126 

CCATTGGCCT GGGGAAATAT AAACTTGTTT GATTACACAG ATACTCTAGT ATCTGGAAAA 132 

ATGGCTTTGA ATCTTTGGCC AGTACCTCAT GGACTAGAAG ATTTGCTGAA CCCTATTGGT 138 

GTTACTGGAT CAAATCCAAA TAAAGAAACT CCATGTTTAG AGTTGGAGTT TGACTGGTTC 144 

AGCAGTGTGG TAAAGTTTCC AGATATGTCA GTGATTGAAG AGCAT.GCCAA TTGGTCTGTA 150 

TCCCGTGAAG CAGGATTTAG TTATTCCCAT GCAGGACTGA GTAACAGACT AGCTAGAGAC 15€ 

AATGAATTAA GAGAAAATGA TAAAGAACAG CTCCGAGCAA TTTGTACACG AGATCCTCTA 163 

TCTGAAATCA CTGAGCAAGA GAAAGATTTT CTGTGGAGCC ACAGACACTA TTGTGTAACT 16J 

ATCCCCGAAA TTCTACCCAA ATTGCTTCTG TCTGTTAAAT GGAACTCTAG AGATGAAGTA 17^ 

GCTCAGATGT ACTGCTTGGT AAAAGATTGG CCTCCAATCA AGCCTGAACA GGCTATGGAG 181 




CTTCTGGACT GCAATTACCC AGATCCTATG GTTCGAGGTT TTGCTGTTCG GTGCTTAGAA 1860 

AAATATTTAA CAGATGACAA ACTTTCTCAG TACCTAATTC AGCTAGTACA GGTACTAAAA 1920 

TATGAACAGT ATTTGGATAA CCTGCTTGTG AGATTTTTAC TCAAAAAAGC GTTAACTAAT 1980 

CAAAGGATCG GTCACTTTTT CTTTTGGCAT TTAAAATCTG AGATGCACAA TAAAACAGTT 2040 

AGTCAGAGGT TTGGCCTGCT TTTGGAGTCC TATTGCCGTG CATGTGGGAT GTATCTGAAG 2100 

CACCTTAATA GGCAAGTTGA GGCTATGGAA AAGCTCATTA ACTTGACTGA CATTCTCAAA 2160 

CAAGAGAAGA AGGATGAAAC ACAAAAGGTA CAGATGAAGT TTTTAGTTGA GCAAATGCGG 2220 

CGACCAGATT TCATGGATGC TCTCCAGGGC TTTCTGTCTC CTCTAAACCC TGCTCATCAG 2280 

CTGGGAAATC TCAGGCTTGA AGAGTGTCGA ATTATGTCTT CTGCAAAAAG GCCACTGTGG 2340 

TTGAATTGGG AGAACCCAGA CATCATGTCA GAATTACTCT TTCAGAACAA TGAGATCATC 2400 

5 TTTAAAAATG GGGATGATTT ACGGCAAGAT ATGCTAACCC TTCAGATTAT TCGCATTATG 2460 

SgAAAATATCT GGCAAAATCA AGGTCTTGAT CTTCGAATGT TACCTTATGG ATGTCTGTCA 2520 

SaTCGGTGACT GTGTGGGACT TATCGAGGTG GTGAGAAATT CTCACACTAT AATGCAGATT 2580 

.3 CAGTGTAAAG GAGGCCTGAA AGGTGCACTG CAGTTTAACA GCCACACACT CCATCAGTGG 2640 

1 CTCAAAGACA AGAACAAGGG GGAAATATAT GATGCGGCCA TCGATTTGTT TACACGATCA 2700 

I TGTGCTGGAT ATTGTGTTGC CACCTTCATT TTGGGAATTG GAGATCGTCA CAATAGTAAT 2760 

3 ATCATGGTTA AAGATGATGG ACAACTGTTT CATATAGATT TTGGACACTT TTTGGATCAC 2820 
? AAGAAGAAAA AATTTGGTTA TAAACGAGAG CGCGTGCCGT TTGTTTTGAC ACAAGATTTC 2880 
" TTAATAGTGA TTAGTAAAGG AGCCCAAGAA TGCACAAAGA CAAGAGAATT TGAGAGGTTT 2940 
CAGGAGATGT GTTACAAGGC TTATCTAGCT ATTCGGCAGC ATGCCAATCT CTTCATAAAT 3000 
CTTTTCTCAA TGATGCTTGG CTCTGGAATG CCAGAACTGC AATCTTTTGA TGATATTGCA 3 060 
TACATTCGAA AGACCCTAGC TTTAGATAAA ACTGAGCAAG AGGCTTTGGA GTATTTCATG 3120 
AAACAAATGA ATGATGCACA CCATGGTGGC TGGACAACAA AAATGGATTG GATCTTCCAC 3180 
ACAATTAAGC AGCATGCTTT GAACTGA 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



3207 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Met Pro Pro Arg Pro Ser Ser Gly Glu Leu Trp Gly He His Leu Met 
1 5 10 15 

Pro Pro Arg He Leu Val Glu Cys Leu Leu Pro Asn Gly Met He Val 
20 25 30 

Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu Val Thr He Lys His Glu 
35 40 45 

Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 



■2 

=5 



Glu Ser Ser Tyr He Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 
65 70 75 80 

Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 

85 90 95 

Pro Phe Leu Lys Val He Glu Pro Val Gly Asn Arg Glu Glu Lys He 
100 105 HO 



Leu Asn Arg Glu He Gly Phe Ala He Gly Met Pro Val Cys Glu Phe 
115 120 125 

Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn He Leu 
130 135 140 



Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 150 155 160 

Ser Arq Ala Met Tyr Val Tyr Pro Pro His Val Glu Ser Ser Pro Glu 

165 170 175 

Leu Pro Lys His lie Tyr Asn Lys Leu Asp Arg Gly Gin He lie Val 
180 185 190 

Val He Trp Val lie Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 



Leu Lys lie Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 



Iff 



lie Arg Lys Lys Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 

225 230 235 240 

Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr lie Leu Lys Val Cys Gly 

245 250 255 



Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 
260 265 270 



lie Arg Ser Cys lie Met Leu Gly Arg Met Pro Asn Leu Lys Met Met 
275 280 285 

Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro Met Asp Cys Phe Thr Met 
290 295 300 



Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 
305 310 315 320 

Glu Thr Ser Thr Lys Ser Leu Trp Val He Asn Arg Ala Leu Arg He 

325 330 335 

Lys He Leu Cys Ala Thr Tyr Val Asn Leu Asn He Arg Asp He Asp 
340 345 350 

Lys He Tyr Val Arg Thr Gly He Tyr His Gly Gly Glu Pro Leu Cys 
355 360 365 

Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 , 

Glu Trp Leu Asn Tyr Asp He Tyr He Pro Asp Leu Pro Arg Ala Ala 
385 390 395 400 

Arg Leu Cys Leu Ser He Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 

405 410 415 

Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 
420 425 430 

Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 



Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 460 



Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 



Ser Ser Val Val Lys Phe Pro Asp Met Ser Val He Glu Glu His Ala 

485 490 495 



Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
500 505 510 



Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 



Glu Gin Leu Lys Ala He Ser Thr Arg Asp Pro Leu Ser Glu He Thr 
530 535 540 



Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 

545 550 555 560 

He Pro Glu lie Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 

565 570 575 



Arg Asp Glu yal Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
580 585 590 



He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
595 600 605 



Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 - 615 620 



Asp Asp Lys Leu Ser Gin Tyr Leu He Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 

645 650 v 655 



Ala Leu Thr Asn Gin Arg He Gly His Phe Phe Phe Trp His Leu Lys 
660 665 670 



Ser Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 685 



Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu Lys His Leu Asn Arg 
690 695 700 

Gin Val Glu Ala Met Glu Lys Leu He Asn Leu Thr Asp He Leu Lys 
705 710 715 720 

Gin Glu Arg Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 

725 730 735 

Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Leu Leu 
740 745 750 

Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Lys Glu 
755 760 765 

Cys Arg He Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 

Asn Pro Asp He Met Ser Glu Leu Leu Phe Gin Asn Asn Glu He He 
785 790 795 800 

Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin He 

805 810 815 

He Arg He Met Glu Asn He Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 

Met Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 

Glu Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 .855 860 

Gly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 

« t > 880 



865 



870 



875 



Leu Lys Asp Lys Asn Lys Gly Glu He Tyr Asp Ala Ala He Asp Leu 

885 890 895 

Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe He Leu Gly 
900 905 910 



He Gly Asp Arg His Asn Ser Asn He Met Val Lys Asp Asp Gly Gin 
915 920 925 



Leu Phe His lie Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 

Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp Phe 
945 950 955 960 

Leu He Val He Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 

965 970 975 

Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 
980 985 990 

Gin His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

Gly Met Pro Glu Leu Gin Ser Phe Asp Asp He Ala Tyr He Arg Lys 
1010 1015 1020 

Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1040 

Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 

1045 1050 1055 

Trp He Phe His Thr lie Lys Gin His Ala Leu Asn Xaa Lys He Thr 
1060 1065 1070 

Glu Lys Met Lys Ala His Ser Gly 
1075 1080 

INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Pro Pro Arg Pro Ser Ser Gly Glu Leu Trp Gly He His Leu Met 
15 10 15 



Pro Pro Arg He Leu Val Glu Cys Leu Leu Pro Asn Gly Met He Val 
20 25 30 



* ♦ 

Thr Leu Glu Cys Leu Arg Glu Ala Thr Leu lie Thr lie Lys His Glu 
35 40 45 



Leu Phe Lys Glu Ala Arg Lys Tyr Pro Leu His Gin Leu Leu Gin Asp 
50 55 60 

Glu Ser Ser Tyr lie Phe Val Ser Val Thr Gin Glu Ala Glu Arg Glu 
65 70 75 80 

Glu Phe Phe Asp Glu Thr Arg Arg Leu Cys Asp Leu Arg Leu Phe Gin 

85 90 95 

Pro Phe Leu Lys Val He Glu Pro Val Gly Asn Arg Glu Glu Lys He 
100 105 110 

Leu Asn Arg Glu He Gly Phe Ala He Gly Met Pro Val Cys Glu Phe 
115 .120 125 

Asp Met Val Lys Asp Pro Glu Val Gin Asp Phe Arg Arg Asn He Leu 
130 135 140 

Asn Val Cys Lys Glu Ala Val Asp Leu Arg Asp Leu Asn Ser Pro His 
145 „ 150 155 160 

Ser Arg Ala Met Tyr Val Tyr Pro Pro Asn Val Glu Ser Ser Pro Glu 

165 170 175 

Leu Pro Lys His He Tyr Asn Lys Leu Asp Lys Gly Gin He He Val 
180 185. 190 

Val He Trp Val He Val Ser Pro Asn Asn Asp Lys Gin Lys Tyr Thr 
195 200 205 

Leu Lys He Asn His Asp Cys Val Pro Glu Gin Val He Ala Glu Ala 
210 215 220 

He Arq Lys Lys Thr Arg Ser Met Leu Leu Ser Ser Glu Gin Leu Lys 
225 230 235 240 

Leu Cys Val Leu Glu Tyr Gin Gly Lys Tyr He Leu Lys Val Cys Gly 

245 250 255 



Cys Asp Glu Tyr Phe Leu Glu Lys Tyr Pro Leu Ser Gin Tyr Lys Tyr 
260 265 270 



He Arg Ser Cys He Met Leu Gly Arg Met Pro Asn Leu Met Leu Met 
275 280 285 



Ala Lys Glu Ser Leu Tyr Ser Gin Leu Pro Met Asp Cys Phe Thr Met 
290 295 300 



Pro Ser Tyr Ser Arg Arg He Ser Thr Ala Thr Pro Tyr Met Asn Gly 
305 310 315 320 

Glu Thr Ser Thr Lys Ser Leu Trp Val He Asn Ser Ala Leu Arg He 

325 330 335 



Lys He Leu Cys Ala Thr Tyr Val Asn Val Asn He Arg Asp He Asp 
340 345 350 



Lys He Tyr Val Arg Thr Gly He Tyr His Gly Gly Glu Pro Leu Cys 
355 360 365 



Asp Asn Val Asn Thr Gin Arg Val Pro Cys Ser Asn Pro Arg Trp Asn 
370 375 380 



Glu Trp Leu £sn Tyr Asp He Tyr He Pro Asp Leu Pro Arg Ala Ala 
385 390 395 400 

Arg Leu Cys Leu Ser He Cys Ser Val Lys Gly Arg Lys Gly Ala Lys 

405 410 415 



Glu Glu His Cys Pro Leu Ala Trp Gly Asn He Asn Leu Phe Asp Tyr 
420 425 430 

Thr Asp Thr Leu Val Ser Gly Lys Met Ala Leu Asn Leu Trp Pro Val 
435 440 445 



Pro His Gly Leu Glu Asp Leu Leu Asn Pro He Gly Val Thr Gly Ser 
450 455 -460 



Asn Pro Asn Lys Glu Thr Pro Cys Leu Glu Leu Glu Phe Asp Trp Phe 
465 470 475 480 

Ser Ser Val Val Lys Phe Pro Asp Met Ser Val lie Glu Glu His Ala 

485 490 495 



Asn Trp Ser Val Ser Arg Glu Ala Gly Phe Ser Tyr Ser His Ala Gly 
500 505 510 



■ IS 



Leu Ser Asn Arg Leu Ala Arg Asp Asn Glu Leu Arg Glu Asn Asp Lys 
515 520 525 

Glu Gin Leu Arg Ala He Cys Thr Arg Asp Pro Leu Ser Glu He Thr 
530 * 535 540 

Glu Gin Glu Lys Asp Phe Leu Trp Ser His Arg His Tyr Cys Val Thr 
545 550 555 560 

He Pro Glu He Leu Pro Lys Leu Leu Leu Ser Val Lys Trp Asn Ser 

565 570 575 

Arg Asp Glu Val Ala Gin Met Tyr Cys Leu Val Lys Asp Trp Pro Pro 
580 585 590 

He Lys Pro Glu Gin Ala Met Glu Leu Leu Asp Cys Asn Tyr Pro Asp 
595 600 605 

Pro Met Val Arg Gly Phe Ala Val Arg Cys Leu Glu Lys Tyr Leu Thr 
610 615 620 



Asp Asp Lys Leu Ser Gin Tyr Leu He Gin Leu Val Gin Val Leu Lys 
625 630 635 640 

Tyr Glu Gin Tyr Leu Asp Asn Leu Leu Val Arg Phe Leu Leu Lys Lys 

645 650 655 

Ala Leu Thr Asn Gin Arg He Gly His Phe Phe Phe Trp His Leu Lys 
660 665 670 

Ser Glu Met His Asn Lys Thr Val Ser Gin Arg Phe Gly Leu Leu Leu 
675 680 685 

Glu Ser Tyr Cys Arg Ala Cys Gly Met Tyr Leu 'Lys His Leu Asn Arg 
690 695 700 

Gin Val Glu Ala Met Glu Lys Leu He Asn Leu Thr Asp He Leu Lys 
705 710 715 720 

Gin Glu Lys Lys Asp Glu Thr Gin Lys Val Gin Met Lys Phe Leu Val 

725 730 735 



Glu Gin Met Arg Arg Pro Asp Phe Met Asp Ala Leu Gin Gly Phe Leu 
740 745 750 



Ser Pro Leu Asn Pro Ala His Gin Leu Gly Asn Leu Arg Leu Glu Glu 
755 760 765 



Cys Arg He Met Ser Ser Ala Lys Arg Pro Leu Trp Leu Asn Trp Glu 
770 775 780 



Asn Pro Asp He Met Ser Glu Leu Leu Phe Gin Asn Asn Glu He He 
785 790 795 800 

Phe Lys Asn Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin He 

805 810 815 



He Arq He Met Glu Asn He Trp Gin Asn Gin Gly Leu Asp Leu Arg 
820 825 830 



Met Leu Pro Tyr Gly Cys Leu Ser He Gly Asp Cys Val Gly Leu He 
835 840 845 

Glu Val Val Arg Asn Ser His Thr He Met Gin He Gin Cys Lys Gly 
850 , 855 860 



Gly Leu Lys Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp 
865 870 875 880 



870 



Leu Lys Asp Lys Asn Lys Gly Glu lie Tyr Asp Ala Ala lie Asp Leu 
2 890 895 



885 



Phe Thr Arg Ser Cys Ala Gly Tyr Cys Val Ala Thr Phe He Leu Gly 
900 905 910 



905 



He Gly Asp Arg His Asn Ser Asn He Met Val Lys Asp Asp Gly Gin 

920 925 



915 



Leu Phe His He Asp Phe Gly His Phe Leu Asp His Lys Lys Lys Lys 
930 935 940 



Phe Gly Tyr Lys Arg Glu Arg Val Pro Phe Val Leu Thr Gin Asp^Phe 



945 



950 



955 



Leu lie Val He Ser Lys Gly Ala Gin Glu Cys Thr Lys Thr Arg Glu 

965 970 975 




Phe Glu Arg Phe Gin Glu Met Cys Tyr Lys Ala Tyr Leu Ala He Arg 
980 985 990 

Gin His Ala Asn Leu Phe He Asn Leu Phe Ser Met Met Leu Gly Ser 
995 1000 1005 

Glv Met Pro Glu Leu Gin Ser Phe Asp Asp He Ala Tyr He Arg Lys 
1010 1015 1020 

Thr Leu Ala Leu Asp Lys Thr Glu Gin Glu Ala Leu Glu Tyr Phe Met 
1025 1030 1035 1040 

Lys Gin Met Asn Asp Ala His His Gly Gly Trp Thr Thr Lys Met Asp 
J 1045 1050 1055 

Trp He Phe His Thr He Lys Gin His Ala Leu Asn Xaa 
1060 1065 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 base pairs 

(B) TYPE: nucleic acid 

(C) STRjANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..3 81 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

GGA GAC GAC TTG CGA CAG GAT CAA CTT ATT CTT CAA ATC ATT TCA CTC 48 
Gly Asp Asp Leu Arg Gin Asp Gin Leu He Leu Gin He He Ser Leu 
1 5 10 15 



ATG GAC AAG CTG TTA CGG AAA GAA AAT CTG GAC TTG AAA TTG ACA CCT 
Met Asp Lys Leu Leu Arg Lys Glu Asn Leu Asp Leu Lys Leu Thr Pro 
20 25 30 

TAT AAG GTG TTA GCC ACC AGT ACA AAA CAT GGC TTC ATG CAG TTT ATC 
Tyr Lys Val Leu Ala Thr Ser Thr Lys His Gly Phe Met Gin Phe He 
35 40 45 

CAG TCA GTT CCT GTG GCT GAA GTT CTT GAT ACA GAG GGA AGC ATT CAG 
Gin Ser Val Pro Val Ala Glu Val Leu Asp Thr Glu Gly Ser He Gin 

55 60 



96 



144 



192 



AAC TTT TTT AGA AAA TAT GCA CCA AGT GAG AAT GGG CCA AAT GGG ATT 240 
Asn Phe Phe Arg Lys Tyr Ala Pro Ser Glu Asn Gly Pro Asn Gly lie 
65 70 75 80 

AGT GCT GAG GTC ATG GAC ACT TAC GTT AAA AGC TGT GCT GGA TAT TGC 288 
Ser Ala Glu Val Met Asp Thr Tyr Val Lys Ser Cys Ala Gly Tyr Cys 

85 90 95 

GTG ATC ACC TAT ATA CTT GGA GTT GGA GAC AGG CAC CTG GAT AAC CTT 336 
Val lie Thr Tyr lie Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu 
100 105 110 

TTG CTA ACC AAA ACA GGC AAA CTC TTC CAC ATC GAT TTC GGC CAC 381 
Leu Leu Thr Lys Thr Gly Lys Leu Phe His lie Asp Phe Gly His 
115 120 125 

3 (2) INFORMATION FOR SEQ ID NO: 39: 

y (i) SEQUENCE CHARACTERISTICS: 
rj (A) LENGTH: 127 amino acids 

g (B) TYPE: amino acid 

3 (D) TOPOLOGY: linear 

(ii) MOLECULE" TYPE : protein 

~i 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

-Gly Asp Asp Leu Arg Gin Asp Gin Leu lie Leu Gin lie lie Ser Leu 

I 1 5 10 . 15 

Met Asp Lys Leu Leu Arg Lys Glu Asn Leu Asp Leu Lys Leu Thr Pro 
20 25 30 

Tyr Lys Val Leu Ala Thr Ser Thr Lys His Gly Phe Met Gin Phe lie 
35 40 45 

Gin Ser Val Pro Val Ala Glu Val Leu Asp Thr Glu Gly Ser lie Gin 
50 55 60 

Asn Phe Phe Arg Lys Tyr Ala Pro Ser Glu Asn Gly Pro Asn Gly lie 
65 70 75 80 

Ser Ala Glu Val Met Asp Thr Tyr Val Lys Ser Cys Ala Gly Tyr Cys 

85 90 95 




Val He Thr Tyr He Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu 
100 105 110 



Leu Leu Thr Lys Thr Gly Lys Leu Phe His He Asp Phe Gly His 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .393 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

■ j 

• 3 GGG GAT GAC TTA CGG CAG GAC ATG CTA ACG CTG CAG ATG ATT CGC ATC 48 
£ Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin Met He Arg He 
y l 5 10 15 



3 ATG AGC AAG ATC TGG GTC CAG GAG GGG CTG GAC ATG CGC ATG GTC ATC 96 
1 Met Ser Lys He Trp Val Gin Glu Gly Leu Asp Met Arg Met Val He 
20 25 30 



'i TTC CGC TGC TTC TCC ACC GGC CGG GGC AGA GGG ATG GTG GAG ATG ATC 144 
1 Phe Arg Cys Phe Ser Thr Gly Arg Gly Arg Gly Met Val Glu Met He 
I 35 40 45 

CCT AAT GCT GAG ACC CTG CGT AAG ATC CAG GTG GAG CAT GGG GTG ACC 192 
Pro Asn Ala Glu Thr Leu Arg Lys He Gin Val Glu His Gly Val Thr 
50 55 60 

GGC TCG TTC AAG GAC CGG CCC CTG GCA GAC CGG CTG CAG AAA CAC AAC 240 
Gly Ser Phe Lys Asp Arg Pro Leu Ala Asp Arg Leu Gin Lys His Asn 
65 70 75 , 80 



CCT GGG GAG GAC GAG TAT GAG AAG GCT GTG GAG AAC TTT ATC TAC TCC 
Pro Gly Glu Asp Glu Tyr Glu Lys Ala Val Glu Asn Phe He Tyr Ser 

85 90 95 



288 



TGC GCT GGC TGC TGC GTG GCC ACG TAC GTC TTG GGC ATC TGT GAC CGA 
Cys Ala Gly Cys Cys Val Ala Thr Tyr Val Leu Gly He Cys Asp Arg 
100 105 HO 



336 
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CAT AAT GAC AAC ATC Alo CTG AAG ACC ACT GGT CAC A'l'G TTC CAC ATC 
His Asn Asp Asn lie Met Leu Lys Thr Thr Gly His Met Phe His lie 
115 120 125 



GAC TTC GGC 
Asp Phe Gly 
130 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

: -Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin Met lie Arg lie 
J 1 5 10 15 

?Met Ser Lys lie Trp Val Gin Glu Gly Leu Asp Met Arg Met Val lie 
) 20 25 30 



^Phe Arg Cys Phe Ser Thr Gly Arg Gly Arg Gly Met Val Glu Met lie 
35 40 45 

I Pro Asn Ala Glu Thr Leu Arg Lys lie Gin Val Glu His Gly Val Thr 
| 50 55 60 

Gly Ser Phe Lys Asp Arg Pro Leu Ala Asp Arg Leu Gin Lys His Asn 
65 70 . 75 80 

Pro Gly Glu Asp Glu Tyr Glu Lys Ala Val Glu Asn Phe lie Tyr Ser 

85 90 95 

Cys Ala Gly Cys Cys Val Ala Thr Tyr Val Leu Gly He Cys Asp Arg 
100 105 ' 110 

His Asn Asp Asn He Met Leu Lys Thr Thr Gly His Met Phe His He 
115 120 125 



Asp Phe Gly 
130 




(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Glv Asp Asp Leu Arg Gin Asp Gin Leu Val Val Gin He He Ser Leu 
1 5 10 15 . 

Met Asn Glu Leu Leu Lys Asn Glu Asn Val Asp Leu Lys Leu Thr Pro 
20 25 30 

Tyr Lys He Leu Ala Thr Gly Pro Gin Glu Gly Ala He Glu Phe He 
35 40 45 

Pro Asn Asp Thr Leu Ala Ser He Leu Ser Lys Tyr His Gly He Leu 
i 50 55 60 

1 
i 

,3 Gly Tyr 

S\ 65 , 

^ (2) INFORMATION FOR SEQ ID NO: 43: 

i (i) SEQUENCE CHARACTERISTICS: 

'% (A) LENGTH: 60 amino acids 

: i (B) TYPE: amino acid 

J (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Leu Lys Leu His Tyr Pro Asp Glu Asn Ala Thr Leu Gly Val Gin Gly 
15 10 15 

Trp Val Leu Asp Asn Phe Val Lys Ser Cys Ala Gly Tyr Cys Val He 
20 25 30 

Thr Tyr He Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu Leu Val 
35 40 45 



Thr Pro Asp Gly His Phe Phe His Ala Asp Phe Gly 
50 55 60 




(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Gly Asp Asp Leu Arg Gin Asp Gin Leu lie Leu Gin lie lie Ser Leu 
1 5 10 15 

Met Asp Lys Leu Leu Arg Lys Glu Asn Leu Asp Leu Lys Leu Thr Pro 
20 25 30 

Tyr Lys Val Leu Ala Thr Ser Thr Lys His Gly Phe Met Gin Phe He 
35 40 45 

J Gin Ser Val Pro Val Ala Glu Val Leu Asp Thr Glu Gly Ser He Gin 

50 55 60 



Asn Phe 
65 



fj(2) INFORMATION FOR SEQ ID NO: 45: 



3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Phe Arq Lys Tyr Ala Pro Ser Glu Asn Gly Pro Asn Gly He Ser Ala 
! y 5 10 15 

Glu Val Met Asp Thr Tyr Val Lys Ser Cys Ala Gly Tyr Cys Val He 
20 25 30 

Thr Tyr He Leu Gly Val Gly Asp Arg His Leu Asp Asn Leu Leu Leu 
35 40 45 



Thr Lys Thr Gly Lys Leu Phe His He Asp Phe Gly 
50 55 60 



INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin lie lie Arg lie 
15 10 15 

Met Glu Asn lie Trp Gin Asn Gin Gly Leu Asp Leu Arg Met Leu Pro 
20 25 30 

Tyr Gly Cys Leu Ser lie Gly Asp Cys Val Gly Leu lie Glu Val Val 
35 40 45 

Arg Asn Ser His Thr lie Met Gin lie Gin Cys Lys Gly Gly Leu Lys 
50 55 60 

Gly Ala Leu Gin Phe Asn Ser His Thr Leu His Gin Trp Leu Lys Asp 
65 , 70 75 80 

Lys Asn Lys Gly Glu 

85 



INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

He Tyr Asp Ala Ala He Asp Leu Phe Thr Arg Ser Cys Ala Gly Tyr 
15 10 15 

Cys Val Ala Thr Phe He Leu Gly He Gly Asp Arg His Asn Ser Asn 
20 25 30 



He Met Val Lys Asp Asp Gly Gin Leu Phe His lie Asp Phe Gly 
35 40 45 



# • 

(2) INFORMATION FOR SEQ ID NO: 48: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Gly Asp Asp Leu Arg Gin Asp Met Leu Thr Leu Gin Met lie Arg lie 
15 10 15 

Met Ser Lys He Trp Val Gin Glu Gly Leu Asp Met Arg Met Val He 
20 25 30 

Phe Arg Cys Phe Ser Thr Gly Arg Gly Arg Gly Met Val Glu Met He 
35 40 45 

Pro Asn Ala Glu Thr Leu Arg Lys He Gin Val Glu His Gly Val Thr 
50 55 60 



Gly Ser 
65 



H (2) INFORMATION FOR SEQ ID NO: 49: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Phe Lys Asp Arg Pro Leu Ala Asp Arg Leu Gin Lys His Asn Pro Gly 
15 10 15 

Glu Asp Glu Tyr Glu Lys Ala Val Glu Asn Phe He Tyr Ser Cys Ala 
20 25 30 

Gly Cys Cys Val Ala Thr Tyr Val Leu Gly He Cys Asp Arg His Asn 
35 40 45 

Asp Asn lie Met Leu Lys Thr Thr Gly His Met Phe His He Asp Phe 
50 55 60 



□ 

'=3 



Gly 
65 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glv Aso Asp Leu Ara Gin Asp Leu Leu Gin He He Met Glu Leu Asp 
15 10 15. 

Leu Pro Tyr Leu Thr Gly Gly He Glu He Asn Gly He Gly Leu Asn 
20 25 30 

He Asp Phe Val Ser Cys Ala Gly Tyr Cys Val Thr Tyr He Leu Gly 
35 40 45 



*- Glv Asp Arq His Asp Asn Gly Leu Phe His lie Asp Phe Gly 

□ 50 55 60 



